Speech Recognition Timeline - Concepedia

Concepedia

Concept

Speech Recognition

Variants

Automatic Speech Recognition

Parents

Speech Sciences

Children

Atypical SpeechAudio Signal AnalysisNatural Language Generation (Natural Language Processing)Natural Language Generation (Speech Language Pathology)Speech Acquisition

72.1K

Publications

3.9M

Citations

110.6K

Authors

9.9K

Institutions

Dynamic Time Warping Alignment

1956 - 1985

The period centers on time alignment and similarity measures, with Dynamic Time Warping (DTW) enabling time-normalized matching and dynamic programming–driven time-warping guiding word-level alignment across utterances. Front-end representations grounded in Linear Predictive Coding (LPC) and cepstral analysis, together with formant and pitch estimation, yield compact, trainable features that support predictive coding and excitation modeling. Vector quantization and early statistical pattern recognition shape the ASR pipeline, while Hidden Markov Models (HMMs) begin to emerge for speaker-independent isolated-word recognition, shaping model-based approaches. Acoustic cues such as spectral formants, cepstral pitch, and voicing inform feature extraction and decision rules.

• Time alignment and similarity measures became the central paradigm for speech recognition, with Dynamic Time Warping (Dynamic Time Warping, DTW) enabling time-normalized matching and dynamic programming-based time-warping guiding word-level alignment across utterances. [9], [16], [7], [12], [10].

• Front-end representations grounded in linear prediction, cepstral analysis, and formant/pitch estimation provided compact, trainable features for recognition, enabling predictive coding and excitation modeling via Linear Predictive Coding (LPC) and cepstrum-based methods. [1], [19], [4], [5], [3].

• Vector quantization and statistical pattern recognition shaped early automatic speech recognition pipelines, with Vector Quantization (VQ) design, LPC-front ends, and emerging integration of Hidden Markov Models for speaker-independent isolated word recognition. [6], [17], [20], [8].

• Acoustic-phonetic cue research established spectral formants, cepstral pitch, and voicing cues as foundational signals for speech perception and recognition, informing feature extraction and decision rules. [5], [4], [15], [2].

Popular Keywords

speech processing

speech perception

speech communication

[1]

Speech Analysis and Synthesis by Linear Prediction of the Speech Wave

1971 • signal processing, speech communication, speech perception, speech processing

[2]

A Cross-Language Study of Voicing in Initial Stops: Acoustical Measurements

1964 • phonetics, phonology, speech communication, speech perception, speech processing, speech production

[3]

Minimum prediction residual principle applied to speech recognition

1975 • phonetics, signal processing, speech communication, speech perception, speech processing, speech technology

[4]

System for Automatic Formant Analysis of Voiced Speech

1970 • phonetics, signal processing, speech perception, speech processing, speech technology

[5]

Cepstrum Pitch Determination

1967 • phonetics, speech communication, speech perception, speech processing, speech technology

[6]

Speech coding based upon vector quantization

1980 • signal processing, speech communication, speech perception, speech processing

[7]

Two-level DP-matching--A dynamic programming-based pattern matching algorithm for connected word recognition

1979 • speech communication, speech perception, speech processing, speech technology

[8]

Design of a linguistic statistical decoder for the recognition of continuous speech

1975 • phonetics, phonology, speech communication, speech perception, speech processing, speech technology

[9]

Considerations in dynamic time warping algorithms for discrete word recognition

1978 • phonetics, phonology, speech communication, speech perception, speech processing, speech technology

[10]

Distortion measures for speech processing

1980 • phonetics, signal processing, speech communication, speech perception, speech processing, speech technology

[11]

Speaker-independent recognition of isolated words using clustering techniques

1979 • phonetics, speech communication, speech perception, speech processing

[12]

Distance measures for speech processing

1976 • phonetics, phonology, signal processing, speech communication, speech perception, speech processing, speech technology

[13]

Effects of Stimulus Content and Duration on Talker Identification

1966 • phonetics, speech communication, speech perception, speech processing, speech production, speech science, speech technology

[14]

Digital speech networks

1977 • signal processing, speech communication, speech perception, speech processing, speech technology

[15]

Role of formant transitions in the voiced-voiceless distinction for stops

1974 • phonetics, phonology, speech communication, speech perception, speech processing, speech production, speech science

[16]

Dynamic programming algorithm optimization for spoken word recognition

1978 • speech communication, speech perception, speech processing, speech technology

[17]

An Algorithm for Vector Quantizer Design

1980 • signal processing, speech processing

[18]

Digital coding of speech waveforms: PCM, DPCM, and DM quantizers

1974 • phonetics, phonology, signal processing, speech communication, speech perception, speech processing

[19]

Adaptive Predictive Coding of Speech Signals

1970 • signal processing, speech communication, speech perception, speech processing

[20]

On the Application of Vector Quantization and Hidden Markov Models to Speaker-Independent, Isolated Word Recognition

1983 • speech communication, speech perception, speech processing

Time-Delay Neural Network Era

1986 - 2001

Neural Sequence Modeling Emergence

2002 - 2008

Deep Neural Acoustic Modeling

2009 - 2015

Self-Supervised End-to-End Speech

2016 - 2024